Image preprocessing for optical character recognition using neural networks

نویسنده

  • Rudolf JAKŠA
چکیده

Primary task of this master’s thesis is to create a theoretical and practical basis of preprocessing of printed text for optical character recognition using forward-feed neural networks. Demonstration application was created and its parameters were set according to results of realized experiments. Project definition and task determination 1. Write a introduction about the problematics of optical character recognition of characters and the methods of image preprocessing before optical character recognition. 2. Design a system for image preprocessing using neural networks. 3. Implement the designed system and simulator of printed text. 4. Realize experiments to determine the settings of the system and to compare the different approaches. 5. Evaluate the realized experiments and their possible practical use. 6. Write a documentation according to the supervisor’s instructions. Introduction Almost everyone who is working with computers has to input some text to the computer from the paper. There is not only one way to do that. The smartest 1 Obr. 1: This is what you get when you scan the printed text. way is to scan the document and let software for optical character recognition (shortened: OCR) transform the scanned image into editable text. The OCR software can use methods like: matrix comparation of image with letter examples from library feature extraction from image recognition of characters using neural networks hybrid and combined methods other methods Each method listed above has some advantages and disagvantages, so if you are using OCR software which uses any of those methods, you know what you can expect. Flexibility of the methods listed above varies from one to another, but even the less flexible method’s success can be improved using image preprocessing before the OCR. The most used methods of image preprocessing before OCR are: thresholding based on histogram smoothing other 2D matrix filters I won’t describe any of these methods of preprocessing here because they’re pretty much known to the public.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Neural Network Based Recognition System Integrating Feature Extraction and Classification for English Handwritten

Handwriting recognition has been one of the active and challenging research areas in the field of image processing and pattern recognition. It has numerous applications that includes, reading aid for blind, bank cheques and conversion of any hand written document into structural text form. Neural Network (NN) with its inherent learning ability offers promising solutions for handwritten characte...

متن کامل

Image Pre-processing on Character Recognition using Neural Networks

This paper, presents a theoretical and practical basis of preprocessing on handwritten text for character recognition using forward-feed neural networks. Afterwards, the Feed forward algorithm gives working of a neural network followed by the Back Propagation Algorithm which compromises Training, Calculating Error, and Modifying Weights. The proposed solutions focus on applying Back Propagation...

متن کامل

Recognition of Handwritten Hindi Characters using Backpropagation Neural Network

Automatic recognition of handwritten characters is a difficult task because characters are written in various curved & cursive ways, so they could be of different sizes, orientation, thickness, format and dimension. An offline handwritten Hindi character recognition system using neural network is presented in this paper. Neural networks are good at recognizing handwritten characters as these ne...

متن کامل

OCR for Handwritten Kannada Language Script

The optical character recognition (OCR) is the process of converting textual scanned image into a computer editable format. The proposed OCR system is for complex handwritten Kannada characters. One of the major challenges faced by Kannada OCR system is recognition of handwritten text from an image. The input text image is subjected to preprocessing and then converted into binary image. Segment...

متن کامل

Offline Handwritten Character Recognition Using Neural Network

This paper is aimed at recognition of offline handwritten characters in a given scanned text document with the help of neural networks. Image preprocessing, segmentation and feature extraction are various phases involved in character recognition. The first step is image acquisition followed by noise filtering, smoothing and image normalization of scanned image. Segmentation decomposes image int...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006